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We consider cDNA microarray experiments when the cell pop- 
ulations have a factorial structure, and investigate the problem of 
their optimal designing under a baseline parametrization where the 
objects of interest differ from those under the more common orthog- 
onal parametrization. First, analytical results are given for the 2x2 
factorial. Since practical applications often involve a more complex 
factorial structure, we next explore general factorials and obtain a col- 
lection of optimal designs in the saturated, that is, most economic, 
case. This, in turn, is seen to yield an approach for finding optimal or 
efficient designs in the practically more important nearly saturated 
cases. Thereafter, the findings are extended to the more intricate sit- 
uation where the underlying model incorporates dye-coloring effects, 
and the role of dye-swapping is critically examined. 

1. Introduction. Optimal designing of cDNA microarray experiments is 
an area of enormous potential that has started opening up in recent years. 
The fields of application include the biological, agricultural and pharmaceu- 
tical sciences. In an experimental design for microarrays, the cell popula- 
tions under study represent the treatments which, as in traditional design 
theory, may be unstructured or have a factorial structure. The design is 
called varietal or factorial in these two situations respectively. In varietal 
designs, interest lies typically in all or some pairwise contrasts of the treat- 
ment effects, whereas in factorial designs, the objects of interest are the main 
effects of the factors and interactions among them. These factorial effects 
are commonly defined via an orthogonal parametrization, but a relatively 
less studied baseline parametrization, which is nonorthogonal, can also be of 
interest depending on the context. The main distinction between these two 
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kinds of parametrization is that the former defines the factorial effects via 
mutually orthogonal treatment contrasts, whereas the latter defines these 
effects with reference to natural baseline levels of the factors and, hence, 
entails nonorthogonality. More details follow in Section 3. 

In a pioneering paper Kerr and Churchill (2001a) discussed the design is- 
sues in microarrays and investigated optimal varietal designs that estimate 
the pairwise contrasts of treatment effects for fixed genes with minimum av- 
erage variance. While observing that microarray designs can be considered 
as incomplete block designs with block size two, they noted the inadequacy 
of the classical optimality results in this regard and obtained, via complete 
enumeration, economic optimal designs for ten or fewer treatments. We re- 
fer to Kerr and Churchill (2001b), Yang and Speed (2002) and Churchill 
(2002) for very informative further discussion on the design issues. While 
Kerr and Churchill (2001a, 2001b) and Churchill (2002) concentrated on 
varietal designs, Yang and Speed (2002) discussed factorial designs in some 
detail. Subsequent work on varietal designs for microarrays includes those 
due to Dobbin and Simon (2002), Kerr (2003), Rosa, Steibel and Tempel- 
man (2005), Wit, Nobile and Khanin (2005) and Altman and Hua (2006), 
although some of these authors, as also Churchill (2002), briefly touched 
upon factorial designs as well. 

Work on factorial designs for microarrays started gaining momentum only 
very recently. A major reference in this regard is Kerr (2006), who worked 
under the framework of the orthogonal parametrization and explored two- 
level factorial designs that keep all main effects and two-factor interactions 
estimable, without any assumption on the absence of higher order interac- 
tions. Exploiting the connection between microarray designs and incomplete 
block designs with block size two, she showed how replicates arising from 
different blocking schemes can be combined for this purpose and, in partic- 
ular, gave designs with the minimum number of replicates for eight or fewer 
factors. Related references in the general context of two-level factorials in 
blocks of size two include Yang and Draper (2003) and Wang (2004) and, 
interestingly, Kerr (2006) proposed a construction which is more economic 
than that in Wang (2004) for any number of factors. Bueno Filho, Gilmour 
and Rosa (2006) also considered factorial microarray designs for both fixed 
and random treatment effects. Their parametrization for fixed treatment 
effects is akin to the orthogonal one and, among other things, they stud- 
ied optimal designs for the 3x3 factorial, paying more attention to the 
case where the two- factor interaction is absent. Further results on facto- 
rial microarray designs under the orthogonal parametrization were obtained 
by Landgrebe, Bretz and Brunner (2006), Gupta (2006) and Grossmann 
and Schwabe (2008). Landgrebe, Bretz and Brunner (2006) studied optimal 
designs within a collection of candidate designs and focused on the 2x2 
and 3x2 factorials, while Gupta (2006) investigated the role of balanced 
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factorial designs. Grossmann and Schwabe (2008) explored optimal designs 
for models that include only the main effects or only the main effects and 
two-factor interactions. 

Turning to factorial designs for microarrays under the baseline parametriza- 
tion, which is the main thrust of this paper, two key references are 
Yang and Speed (2002) and Glonek and Solomon (2004), (hereafter abbrevi- 
ated GS). While Yang and Speed (2002) broadly discussed the design issues, 
GS reported illuminating computational results on admissible designs for the 
2x2 factorial. Continuing with the baseline parametrization, we propose to 
consider general, possibly asymmetric, factorials and present analytical re- 
sults. For comparative purposes, results under the orthogonal parametriza- 
tion will also be occasionally indicated. The present endeavor is motivated 
by two reasons. First, as will be seen in Section 3, there are practical situa- 
tions, even beyond the domain of microarray experiments, where the baseline 
parametrization is natural. Second, although this parametrization looks sim- 
pler than the orthogonal one, it renders the task of finding optimal or efficient 
designs somewhat more challenging due to lack of orthogonality. Presumably 
due to this reason, even in traditional factorial design literature, the optimal 
design problem under the baseline parametrization has received very little 
attention. It is hoped that our results would fill in this gap to some extent. 
We remark in this connection that because of the considerable difference 
in the definitions of the factorial effects under the baseline and orthogonal 
parametrizations, the significant body of work that has already been done 
on factorial microarray designs under the latter parametrization provides no 
clue to our derivation or results. The following example, concerning a study 
of leukemic mice, highlights this point; more details on some of the technical 
terms in the example are available in Sections 2 and 3. 

Example 1. GS describe a cDNA microarray experiment that compares 
two cell lines FIA and V449E at times hours and 24 hours. The cell line 
V449E proliferates into leukemia while FIA is nonleukemic. Then there are 
two factors dictating the cell populations. The first factor, namely, the mu- 
tant, has two levels FIA and V449E of which FIA, being nonleukemic, is 
taken as the baseline level. These levels are coded as and 1 respectively. 
The second factor is time, again with two levels, hours and 24 hours, and 
the first of these is taken as the baseline level. These two levels are also 
coded as and 1 respectively. Thus, considering the two factors together, 
there are four treatment combinations, 00, 01, 10 and 11, representing the 
cell populations. The main effects of the two factors are important, but their 
interaction, that concerns the differential expression of genes for V449E and 
FIA at time 24 hours as contrasted with that at time hours, is often of 
even greater interest. The experiment consists of a number of slides each 
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comparing a pair of cell populations or, equivalently, treatment combina- 
tions. Suppose the available resources allow experimentation with six slides. 
Since the four treatment combinations can also be paired in six ways, namely, 
(01, 00), (10, 00), (11, 00), (10, 01), (11,01) and (11, 10), the symmetric design, 
that compares each pair on one slide, seems very attractive, because it is a 
balanced incomplete block (BIB) design with excellent optimality properties 
under the orthogonal parametrization [Kiefer (1975)]. As a rival, consider 
the design that compares the pairs (01,00) and (10,00) each on two slides, 
and the pairs (11,01) and (11,10) each on one slide. As noted in GS, un- 
der the baseline parametrization, the symmetric design estimates the two 
main effects and the interaction with respective variances |<r 2 , |<r 2 and a 2 , 
whereas the corresponding variances for the rival design are J2& 2 , f^c 2 and 
|<7 2 , where a 2 is the common variance of the observations. Thus, the rival 
design outperforms the symmetric one not only in overall terms but also in- 
dividually for each factorial effect, in the sense of entailing uniformly smaller 
variance, that is, more precise estimation. 

The above example is revealing. It shows that even for the seemingly sim- 
ple 2x2 factorial, optimal designs under the orthogonal parametrization are 
by no means guaranteed to perform well when one works with the baseline 
parametrization where entirely new designs may turn out to be desirable. 
This opens up new challenges which become even more complex for general 
factorials. 

The paper is organized as follows. The next section gives an outline of 
the experimental setup. In Section 3 we revisit the 2x2 factorial, consid- 
ered previously by GS, and obtain analytical results which supplement and 
strengthen their computational findings, in addition to preparing the ground 
for the subsequent development. Taking cognizance of the facts that in many 
applications there may be more than two factors dictating the cell popula- 
tions and that, even with two factors, one or both of them may appear at 
more than two levels, general factorials are considered from Section 4 on- 
ward. Since cDNA microarray experiments are still quite expensive, there 
is a premium on optimal designs that are relatively small in size. From this 
viewpoint, in Section 4, we first consider the saturated, that is, most eco- 
nomic, case and present a collection of optimal designs in a strong sense. 
Apart from facilitating a choice under resource constraints, this leads to an 
approach for finding optimal or efficient designs in the practically more im- 
portant nearly saturated cases that are also studied at length in Section 4. 
In Section 5 we extend the main ideas of Section 4 to the situation where the 
underlying model includes dye-coloring effects. The findings of this section 
rigorously justify, for such a model, a recommendation by Yang and Speed 
(2002) on dye-swapping. Several other design issues, including open prob- 
lems, are discussed in Section 6. Technical details, including proofs, appear 
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in a supplementary material file posted at the journal website [Banerjee and 
Mukerjee (2008)]. The technical tools include use of approximate design 
theory, Kronecker representation and unimodularity. 

2. Experimental setup. We refer to Nguyen et al. (2002) and 
Amaratunga and Cabrera (2004) for detailed accounts of the experimental 
setup. In cDNA microarrays, each slide compares two cell populations on the 
basis of mRNA samples separately labeled with fluorescent dyes, usually red 
and green. This is done for a number of slides and different slides may com- 
pare different pairs of cell populations. After competitive hybridization, the 
ratio of the red and green fluorescence intensities is measured at each spot 
on each slide. Any such ratio represents the relative abundance of the gene in 
the two cell populations compared on the corresponding slide. The intensity 
ratios are usually adjusted for background noise and then normalized with 
the objective of removing systematic biases. 

We consider linear models for the log intensities and, hence, the log in- 
tensity ratios. The modeling as well as the corresponding optimal design 
problem refers to a single gene — it is intended that the same design applies 
simultaneously to all genes on the array. The log intensity ratios for a gene, 
arising from different slides, are supposed to be homoscedastic and uncor- 
rected; a discussion on this, in the light of biological variability, follows in 
Section 6. 

The above experimental setup is structurally similar to classical paired 
comparison experiments; see Kerr and Churchill (2001a). The cell popula- 
tions under comparison are the same as treatments (or treatment combina- 
tions when they are dictated by several factors as in Example 1), while each 
slide is equivalent to a block of size two. However, the stringency on the 
number of slides as well as the baseline parametrization adopted here open 
up new design problems. 

3. The 2x2 factorial. 

3.1. The baseline parametrization. Suppose two factors F\ and F2, each 
at levels and 1, dictate the cell populations, which correspond to the 
treatment combinations 00,01, 10 and 11. Let too,tqi,tio an d T\\ denote the 
expected log intensities, that is, the effects, of these treatment combinations. 
We focus on the situation where, as in Example 1, there is a null state or 
baseline level, say, 0, of each factor. Then #00 = Too stands for the baseline 
effect. We consider the baseline parametrization [cf. Yang and Speed (2002); 
GS] according to which the main effects of F\ and F2 are given respectively 
by 
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while the interaction effect F1F2 is given by 



(2) 



011 = Til — 710 — TQl + TOO- 



The counterparts of #10, #01 an d $11 under the more common orthogonal 
parametrization are defined respectively as 

0*0 = |(TH +T10 — TQl - Tqo), #01 = ^( T H -T10 +T01 - Tqo), 



Observe that the definitions of the main effects under the two parametriza- 
tions are entirely different. While 0\\ is proportional to 0* l5 this equivalence 
for the two-factor interaction also disappears for factorials involving three 
or more factors; see, e.g., (8) below. 

Kerr (2006) nicely summarized the situations under which the two para- 
metrizations mentioned above are appropriate. The baseline parametrization 
is natural if there is a clear null state or baseline level of each factor. As noted 
above, this happens in Example 1. Similarly, in a toxicological study with bi- 
nary factors, each representing the presence or absence of a particular toxin, 
the state of absence can be regarded as a natural baseline level of each factor 
[Kerr (2006)]. On the other hand, if at least one factor, like gender, lacks 
a natural baseline level, then the baseline parametrization is inappropriate 
because this will arbitrarily single out one level of such a factor. In situations 
of this kind, it is advisable to use the orthogonal parametrization. 

Indeed, the null state or baseline level of a factor can be interpreted in 
a broad sense. It need not strictly mean the zero level on some scale, but 
may as well refer to a standard or control level like the one currently used 
in practice. For example, in an agricultural experiment to investigate possi- 
ble improvement in productivity by changing the doses of several fertilizers, 
the currently used doses of the fertilizers may represent the control levels. 
Similarly, in an industrial experiment on possible quality improvement via a 
change in the settings of several machines used at different stages of the pro- 
duction process, the current settings of the machines may reasonably consti- 
tute the control levels. In general, if each factor has such a control or baseline 
level along with one or more test levels, then the baseline parametrization 
is appropriate and, hence, the present results should be useful. The possible 
areas of application extend well beyond microarrays and pertain, notably, 
to agricultural and industrial experiments as hinted above. We add in this 
connection that although not much work has so far been reported on opti- 
mal factorial designs under the baseline parametrization, there is already a 
rich literature on the corresponding problem for varietal designs. An excel- 
lent review of this development on treatment-control designs is available in 
Majumdar (1996). 



(3) 
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3.2. Design criteria. Following GS, in this section we assume the absence 
of systematic biases including dye-color bias because one of our objectives 
here is to obtain analytical results in their setup. With four treatment com- 
binations 00, 01, 10 and 11, as in Example 1, there are six possibilities for any 
slide, namely, (01, 00), (10, 00), (11, 00), (10, 01), (11, 01) and (11,10), where 
the members of each pair represent the treatment combinations that can be 
compared on the slide. Within each pair, one member gets red dye-coloring 
and the other green dye-coloring, but the distinction is immaterial in the 
absence of dye-color bias. Suppose the total number of slides used in the 
experiment is fixed at N. Then the design problem involves deciding on 
the respective frequencies fi,---,fe with which the slides of the six kinds 
as listed above should appear in the experiment, so as to entail optimal 
inference in a reasonable sense. Here fx, . . . , f$ are nonnegative integers sat- 
isfying /i + h /g = N. We consider only those designs that keep #oi ; $io 

and 9n estimable. Let Vbi, Vio and Vn denote the variances of the best lin- 
ear unbiased estimators (BLUEs) of #oi>#io and On, respectively. A good 
design should aim at keeping these three variances small. Recognizing that 
commonly no single design will minimize all the three simultaneously, GS 
considered admissible deigns. A design do is admissible if there is no other 
design d\ such that each of Vbi,Vio and V\\ under d\ is less than or equal 
to that under do, at least one of these inequalities being strict. By com- 
plete enumeration, GS tabulated admissible designs for even N in the range 
6<A< 18. 

The notion of admissibility is intimately linked with that of weighted 
optimality. In most applications, one wishes to give equal weight to the 
two main effect parameters. Also, as GS noted, the interaction parameter 
can be of greater interest in microarrays than the main effect parameters. 
From this perspective, we consider designs that minimize Vbi + V\q + wV\\ , 
where w is a positive weight, with particular interest in case w > 1. Such a 
design, called w- optimal for simplicity, is evidently admissible. Indeed, even 
for moderate N, admissible designs may be too numerous, and consideration 
of u;-optimality helps in narrowing down the choice. 

3.3. Results via approximate design theory and their implications. The 
fact that the frequencies fi,...,fe are integer- valued complicates the task of 
finding w-optimal designs because tools from calculus cannot be employed. 
This is particularly so because the objective function Vbi + Vio + wV\\ de- 
pends on these frequencies in a complex manner. Considerable simplicity is 
achieved if for the moment we treat the relative frequencies tti = fi/N as 
continuous variables over the range 7Tj > for each i and = 1- Any such 
7T = (m, . . . , ttq) is called a design measure. This approach amounts to invok- 
ing the approximate design theory [see, e.g., Silvey (1980)] which enables 
one to use calculus techniques to get the following result. 
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Result 1. (a) For w > 0, let 



(4) 



i = \{{w 2 + 2w) 



1/2 



w}. 



Then the design measure 




To=(5-£,5-e,o,o,e,o 



(b) The design measure ft = (4, 4,0,0, 4, 4) minimizes Vn and is admis- 
sible. 

(c) The design measure (5 — £, 3 ~~ £)0,0,£,£) is admissible whenever 



Incidentally, Bueno Filho, Gilmour and Rosa (2006) and Grossmann and 
Schwabe (2008) also employed the approximate theory in the study of op- 
timal microarray designs. But their settings and criteria and, hence, final 
results are different from ours. We now discuss the implication of Result 1 
on (exact) designs that take cognizance of the fact that fi,---,fe are actually 
integers. Any such design may be represented by the vector / = (fi, ■ ■ ■ , fe)- 
Since 7Tj = fi/N for each i, the following conclusions, pertaining to even N, 
are evident from Result 1. 

(i) If w > I and (f> = N£ is an integer, where £ is given by (4), then the 



is w-optimal. 

(ii) If TV is a multiple of 4, then the design (±iV, ± TV, 0, 0, ±JV, \N) mini- 
mizes V11. 

(iii) Any design of the form (6) is admissible whenever \N <cf)< jN. 

The points (i)-(iii) noted above cater to the need, in our context, of finding 
good designs with emphasis on the interaction parameter. For N < 18, they 
provide analytical justification for quite a few findings of GS, such as the 
admissibility of the rival design in Example 1. In addition, they facilitate 
the study of good designs for N > 20, which is beyond the range considered 
by GS and may pose difficulties in complete enumeration. For instance, if 
iV = 20, then they show that the designs (6,6,0,0,4,4) and (5,5,0,0,5,5) 
are admissible, and that the latter design minimizes Vu. 

Given w (> |), even if <fi = iV£ in (i) above is not an integer, one may 
simply round it off to the nearest integer to get a highly efficient design. As an 
illustration, let w = 2. By (4), then £ = 0.207107. For N = 22, rounding 7V£ 
off to the nearest integer, namely, 5, we can follow (6) to consider the design 



6-^ — 4- 



design 



(6) 
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(6,6,0,0,5,5), which has efficiency 99.44% as a comparison with the w- 
optimal design measure in (5) reveals. Continuing with w = 2, for every even 
N in the range 6 < N < 30, one can similarly obtain designs with over 97%, 
and often over 99%, efficiency. These efficiencies are actually lower bounds, 
as they are relative to an optimal deign measure which is unattainable in 
the exact setup. Hence, we conjecture that all these designs are actually 
^-optimal, with w = 2, for the respective N. Using (iii) above, one can also 
verify that these designs are all admissible. 

It is of interest to compare Result 1 with its counterpart arising under the 
orthogonal parametrization (3). To that effect, we note that the following 
hold under (3): 

(a) The design measure 7i"o rth = (a, a, \ — 2a, \ — 2a, a, a), where a = 
\w 1 / 2 /(2 + w 1 / 2 ), is w-optimal for < w < 4. 

(b) The design measure tt = (|, |,0,0, \, \) is ^-optimal for w > 4. 

(c) The design measure(a, a, \ — 2a, \ — 2a, a, a), with a as in (a), is 
admissible for < a < ^ . 

The proofs of these are similar to but simpler than that of Result 1. From (a) 
and (b) above, simple rounding off again yields highly efficient exact designs 
under the orthogonal parametrization. For w < 4, unlike ttq in Result 1, the 
measure 7rQ rth in (a) assigns positive masses to all the six possible slides. In 
fact, for w = l,7TQ rth assigns uniform mass everywhere and, hence, entails 
a BIB design. On the other hand, for w > 4, the optimal design measures 
under the two parametrizations are quite close to each other. 

The fact that n has only six elements, because of only six possibilities 
for any slide, helped the study of optimal design measures and, hence, that 
of optimal or efficient designs in this section. In microarray experiments for 
general factorials considered from Section 4 onward, the number of possibil- 
ities for any slide increases dramatically and, as a result, the optimal design 
measures are analytically intractable. For this reason, hereafter we directly 
investigate exact designs. 

4. General factorials. 

4.1. Preliminaries. In many applications of cDNA microarrays there 
may be more than two factors dictating the cell populations and, even if 
there are only two factors, one or both of them may appear at more than 
two levels. For instance, if in Example 1 the two cell lines are compared at 
time 12 hours, in addition to hours and 24 hours, then we have to consider 
a 2 x 3 factorial, with the second factor, time, now appearing at three lev- 
els. Similar examples abound and underscore the practical need to explore 
the optimal designing of cDNA microarray experiments with reference to 
general factorials. 
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From this perspective, consider an s\ x • • • x s n factorial that involves 
n (> 2) factors F±, .. .,F n dictating the cell populations, with Fj appearing 
at levels 0, l,...,Sj — 1. Then there are v = Y\sj cell populations which 
correspond to the treatment combinations i\ . . . i n (0 < i 3 < Sj — 1, 1 < j < n). 
Let Ti x ,„i n be the expected log intensity, that is, the effect, of the treatment 
combination i\ . . .i n . As before, the baseline level of each factor is denoted 
by 0. Hence, #oo...o = T oo...o stands for the baseline effect. Also, as an obvious 
extension of the baseline parametrization given by (1) and (2), a main effect, 
say, that of F\, is represented by the si — 1 parameters 

(7) #iiO...O = TiiO...O - T00...0 (1 < h < s\ - 1), 

whereas a two- factor interaction, say, F1F2, is represented by the (si — 
l)( s 2 — 1) parameters 

(8) ^iii 2 0...0 = ^ii 2 0...0 — r iiOO...O — T"0i20...0 + r OOO...O 

(l<«i<ai-l,l<»2<S2-l)- 

Similarly, we can define for every i\ . . . i n ^ . . . (0 < ij < Sj — 1, 1 < 

j < n) so that any such represents a factorial effect as determined by 

its nonzero subscripts. Thus, any 6i^...% n with u nonzero subscripts represents 
a factorial effect involving u factors. Hereafter, often the v — 1 parameters 
Qh...in {h ■ ■ ■ in 7^ . . . 0) are collectively referred to as the 6s for ease in 
presentation. 

Note that (7) is reminiscent of the canonical parametrization in Wit, 
Nobile and Khanin (2005), Subsection 2.1, for varietal designs. Throughout 
this section we continue to assume the absence of systematic biases including 
dye-color bias and write a 1 for the variance of any observed log intensity 
ratio. 

4.2. Optimal saturated designs. All main and interaction effects, as rep- 
resented by the 6s, are of potential interest at least for a relatively small 
number of factors. Hence, at this stage we consider optimal designs for the 
estimation of all these v — 1 parameters. Clearly, then the number of slides, 
N, in the experiment must satisfy N > v — 1. We first consider the satu- 
rated case N = v — 1 and obtain a collection of optimal designs. In addition 
to facilitating a choice under resource constraints, this paves the way for the 
development of an approach for finding optimal or efficient designs in the 
practically more important nearly saturated cases that are taken up in the 
next subsection. 

Result 2. Let N = v — 1 and consider a design that keeps all the 6s 
estimable. Then for any ^ij...^, which represents a factorial effect involving 
u factors, 

(9) Var(^ 1 ... 4 J>a 2 2«- 1 , 
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where ^...^is the BLUE of Qi x ...i n - 

Result 3 below shows that the same design can attain the lower bound in 
(9) simultaneously for all the 0s. Such a design is then optimal not only in 
overall terms but also individually for every parameter representing a main 
or interaction effect. In what follows, a slide which compares treatment com- 
binations i\ . . . i n and j\ . . .j n , respectively with red and green dye-coloring, 
is denoted by the ordered pair (i± . . . i n ,ji ■ ■ - jn)- A design is represented by a 
collection of such pairs. Note that the ordering within any pair is immaterial 
at this stage for inferential purposes, as we are now assuming the absence of 
dye-color bias. For any i\ . . . i n ^ . . . 0, let p(i± . . . i n ) be obtained replacing 
the first nonzero entry of i\ . . . i n by and leaving the other entries un- 
changed. For instance, with a 2 x 2 x 3 factorial, p(012) = 002, ^(lll) = Oil 
etc. 

Result 3. Let N = v — 1. Then the design 

d = {(ii . ..i n ,p(ii . ..i n )):0< ij < Sj - 1, 1 < j < n, h . . .i n ^ . . . 0} 

leads to the attainment of the lower bound in (9) simultaneously for all the 
6s. 

Remark 1. For n = 2, one can check that Result 3 remains valid if for 
every i\ > > l,p(iiZ2) is allowed to be either 0^ or i±0, instead of being 
fixed at 0^2 as stipulated above. Because of the two possibilities for any such 
p(i±i2), one gets a collection of 2( S1 ~ 1 )( S2 ~ 1 ) optimal designs. 

Remark 2. Examples can be given to show that, for n > 3, Result 3 does 
not remain valid if, in the spirit of Remark 1, each p(ii...i n ) is obtained 
simply by replacing an arbitrary, rather than the first, nonzero entry of 
i\. . .i n by 0. However, even then, Result 3 leads to a collection of optimal 
designs via factor permutation. To illustrate this point, observe that Result 
3 yields the design 

d = {(001, 000), (002, 000), (010, 000), (011, 001), (012, 002), (100, 000), 

(101,001), (102,002), (110,010), (111,011), (112,012)} 

for a 2 x 2 x 3 factorial, and the design 

{(001,000), (010,000), (011,001), (100,000), (101,001), (110,010), 

(111,011), (200,000), (201,001), (210,010), (211,011)} 

for a 3 x 2 x 2 factorial. Permuting the factors in the latter, one readily gets 
another design for the 2x2x3 factorial which, like do, is optimal in the 
sense of Result 3. In the same manner, Result 3 can be easily applied to all 
possible factor orderings to yield a collection of optimal designs. 
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4.3. Nearly saturated designs. The optimal designs Section 4.2 are satu- 
rated and, hence, do not yield an internal estimator of a 2 which is important 
for testing of hypotheses. This difficulty can persist even if the same clone is 
replicated r (> 1) times on each slide. Then, for the purpose of estimating 
the 6s, the means of the r log intensity ratios arising from the slides play the 
role of the individual ratios considered so far, but an attempt to estimate 
a 2 on the basis of the within slide variation can be vitiated by unknown 
correlation among the ratios arising from the same slide [Yang and Speed 
(2002) and Churchill (2002)]. 

In view of the above, as a feasible yet economic approach to getting degrees 
of freedom for the estimation of c 2 , one may like to have a little more 
than v — 1 slides and thus consider nearly saturated designs. Unlike in the 
saturated case, typically for N > v — 1, no single design can estimate all 
the 9s with the minimum variance. We, therefore, consider the w-optimality 
criterion as applicable to general factorials. Analytical derivation of optimal 
designs, via either combinatorial techniques or approximate design theory, 
still remains difficult for N > v — 1. 

The results in the last subsection, however, readily yield a heuristic ap- 
proach which, as computations indicate, leads to highly efficient, if not op- 
timal, designs. Based on the intuitive expectation that, for N close to v — 1, 
a design obtained via augmentation of an optimal saturated design should 
behave well, we propose the following steps: 

(I) Given s±, . . . , s n , list all optimal saturated designs given by Result 
3 and Remark 1 or 2. 

(II) Given N, augment each design in (I) in all possible ways to generate 
designs with N slides. 

(Ill) From the augmented designs in (II), select one as per the chosen 
optimality criterion, also taking care of resource constraints, if any. 

For N close to v — 1, computationally the above steps are far easier to 
implement than a complete enumeration of all designs. The least favorable 
cases for this approach are those where the saturated designs involve a rather 
small number of slides, and hence, even a slightly larger N can potentially 
have a significant impact. From this viewpoint, the approach is now evalu- 
ated for 2x2 and 2x3 factorials which represent the two smallest saturated 
cases. We consider «;-optimal designs that, for an s\ x S2 factorial, aim at 
minimizing 

Si — 1 S2 — 1 Si— lS2 — l 

E Var(4o) + E Var(4 2 ) + ^ E E Var(0 ilia ). 

il=l 22 = 1 tl=l 12 = 1 

Tables 1 and 2 show ^-optimal designs, as obtained by complete enumeration 
of all designs, for 2x2 and 2x3 factorials, with w = 1,2,3, and N = v — 
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Table 1 

w -optimal designs for the 2x2 factorial 



N 


w 


to-optimal design 


4 


1,2,3 


{(01,00), (10, 00), (11,01), (11,10)} 


5 


1,2,3 


{(01,00), (10,00), (10,00), (11,01), (11, 10)} 


6 


1,2,3 


{(01,00), (01,00), (10,00), (10,00), (11,01), (11, 10)} 



1 = 1j2, 3). While these optimal designs can be nonunique, only one 

such design is reported in each case to save space. 

Every design in Table 1 or 2 contains the optimal saturated design do of 
Result 3 as a subdesign, thus showing that the heuristic approach, based on 
augmentation, indeed yields a w-optimal design in each of these cases. For 
the 2x2 factorial, we have also checked that all admissible designs for N = 
4, 5 and 6 are augmentations of one of the two optimal saturated designs, 
{(01, 00), (10, 00), (11, 01)} and {(01, 00), (10, 00), (11, 10)}, arising from Re- 
mark 1. 

For an s\ x S2 factorial, let d be the design obtained as the union of the 
2( s i~ 1 )( s 2~ 1 ) optimal saturated designs given by Remark 1, that is, J consists 
of the v — 1 + (si — l)(s2 — 1) slides {i\i2, 0^2 ), 1 < H < si — 1 5 < i<i < S2 — 1, 
and (iit2, *i0), < i\ < s± — 1, 1 < %2 < «2 — 1- Let £1 be the class of all designs 
that consist of N slides from d, where v — 1 < N < v — 1 + (s\ — l)(s2 — 1). 
Tables 1, 2 and partial enumeration in several other cases lead us to the 
following conjecture. 

Conjecture, (a) If N = v — 1 + (si — l){s 2 - 1), then the design d is 
ID-optimal for any w > 1. 

(b) If v — 1 < N < v — 1 + (si — l)(s2 — 1), then for any w > 1, a w-optimal 
design in O is also w-optimal in the class of all designs. 

The case N = 4 in Table 1 and the cases N = 6,7 in Table 2 pertain 
to the Conjecture and show its truth, with w = 1,2,3, for 2x2 and 2x3 
factorials. Furthermore, using approximate design theory, the efficiency of 



Table 2 

•w-optimal designs for the 2x3 factorial 



N 


w 


lo-optimal design 


6 


1,2,3 


{(01, 00), (02, 00), (10, 00), (11, 01), (12, 02), (11, 10)} 


7 


1, 2, 3 


{(01,00), (02,00), (10, 00), (11, 01), (12,02), (11, 10), (12, 10)} 


8 


1 


{(01,00), (02,00), (10,00), (11, 01), (12, 02), (11, 10), (12, 10), (02,01)} 


8 


2,3 


{(01,00), (02, 00), (10, 00), (10, 00), (11,01), (12, 02), (11, 10), (12, 10)} 
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the design d in (a) is seen to be at least 91.88%, 94.16% and 95.57% for 
the 2 x 4 factorial, and at least 94.26%, 96.16% and 97.04% for the 3 x 3 
factorial, under w = 1,2 and 3 respectively. Indeed, if this Conjecture is true 
in general, then part (b) would considerably reduce the search for an optimal 
design, while part (a) would give a compact result. 

For general factorials, one can define the orthogonal parametrization via 
a straightforward extension of (3); see, for example, Gupta and Mukerjee 
[(1989), Chapter 2]. Under this parametrization, BIB designs are optimal in a 
very strong sense [Kiefer (1975)] and extended group divisible (EGD) designs 
are known to be admissible [Gupta and Mukerjee (1989), Chapters 3 and 8]. 
While Example 1 demonstrates that a BIB design can become inadmissible 
under the baseline parametrization, we now show that the same can happen 
with EGD designs. Note that in the context of microarrays, an EGD design is 
one where the number of slides comparing any two treatment combinations 
i\...i n and j\...j n depends only on the equality or otherwise of i u and 
jui 1 < u < n. Thus, for the 2x3 factorial and with N = 6 slides, there 
is a unique EGD design {(11, 00), (12, 00), (10, 01), (12, 01), (10, 02), (11, 02)} 
that allows the estimability of all treatment contrasts. Under the baseline 
parametrization, this design becomes inadmissible because it estimates each 
of the 9s with uniformly larger variance than the w-optimal design shown 
in Table 2 for N = 6. 

5. Results under effects due to dye-coloring. We now extend the main 
ideas of Section 4 to the situation where the underlying model includes effects 
due to dye-coloring. For < ij < sj — 1, 1 < j < n, let i and i be the 
expected log intensities for the treatment combination i\ . . . i n under red and 
green dye-coloring respectively. Then r^...^ = ^{P^ i + fi\ f i n } represents 
the overall effect of u . . . i n , whereas Aj, ; = hi 3^ „■ — 8^ 4 ) accounts for 

1 •••"71 £ *l..<tn 1 1 • • • ijTI j 

the effect of dye-coloring on i\ . . .i n . For any slide (i\ . . - i n ,ji . . .j n ), which 
compares treatment combinations i\. . .i n and j\ . . .j n respectively with red 
and green dye-coloring, the expected log intensity ratio is now given by 

(10) Pil—in ~ Pjl—jn = Ti l—in ~ T jl—jn + ^h—in + ^jl—jn- 

The parameters of interest continue to be the #s , representing the main 
and interaction effects and defined with reference to the rs as in Section 
4.1. The As are, on the other hand, nuisance parameters to us. Unlike in 
the previous sections, where we took the As as zeros, now these are kept 
perfectly general. Hence, as (10) indicates, the ordering within the slides 
is no longer inconsequential. A reduced but more restrictive version of the 
model (10) will be considered briefly in Section 6. 

In the presence of dye-coloring effects, several authors, notably Yang and Speed 
(2002), advocated the use of dye-swapped experiments. It is not hard to see 
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that, under (10), any estimable contrast of the rs is estimated orthogo- 
nally to the As in such an experiment. Let do be any optimal design arising 
from Result 3, Remark 1 or Remark 2. The dye-swapped version of do, de- 
noted by dg Wap , is a design that includes both the slides {i\ . . -i n ,ji . . .j n ) 
and (ji . . 

• jni H • ■ • in) for every slide {i\ . . . i n , ji ...j n ) in d . Given the op- 
timally of do in the absence of dye-coloring effects, one may be inclined 
to recommend the use of c?Q Wap in the present setup. However, in order to 
justify this rigorously, the following questions need to be answered: 

(a) There are 2(v — 1) slides in dQ Wap . Are at least 2(v — 1) slides required 
to estimate all the 6s under (10), even when possibly nonorthogonal (to the 
As) estimation is allowed? 

(b) Under (10), will <iQ Wap be optimal, in the sense of Result 3, among all 
designs that involve 2(v — 1) slides and keep the 6s estimable? 

The possibility of nonorthogonal estimation complicates (a). Similarly, 
(b) needs careful attention because orthogonality alone does not guarantee 
optimality with 2(v — 1) slides. 

Satisfyingly, the answers to both (a) and (b) are in the affirmative. 
The following results confirm this and, hence, vindicate the proposal of 
Yang and Speed (2002) about dye-swapping. As before, the total number 
of slides is denoted by N. Also, we continue to assume that the log intensity 
ratios arising from different slides are uncorrelated and homoscedastic with 
common variance a 2 . 

Result 4. Under the model (10), at least 2(v — 1) slides are required 
to keep all the 6s estimable. 

Result 5. Let N = 2{v — 1) and consider a design that keeps all the 6s 
estimable under (10). Then for any 0j 1 „.i Il , which represents a factorial effect 
involving u factors, 

(11) Var(^... l J>a 2 2 M - 2 , 
where 6i x ...% n is the BLUE of 6i x ...% n under (10). 

Result 6. Let N = 2(v — 1) and dQ Wap be a design defined as above. 
Then, under (10), dg Wap leads to the attainment of the lower bound in (11) 
simultaneously for all the 6s. 

These results show the optimality of d™ ap in a strong sense. Although 
Result 5 resembles Result 2, its proof involves much extra work. Following 
Remarks 1 and 2, there is considerable flexibility in the choice of do and 
hence that of dQ Wap . In addition to being helpful under resource constraints, 
this facilitates the task of finding highly efficient designs when one intends 
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to use a little more than 2(v — 1) slides so as to gain degrees of freedom for 
the estimation of a 2 . For this purpose, the same heuristic approach as in 
Section 4.3 can be followed with the only change that now in step (I), all 
possibilities for <iQ Wap , corresponding to do arising from Result 3 and Remark 
1 or 2, have to be considered. As an illustration, consider the 2x2 factorial 
and let N = 8. Then, under the criterion of w-optimality (w = 1, 2 or 3), the 
above approach yields the design 

{(01, 00), (10, 00), (11,01), (00,01), (00, 10), (01, 11), (11, 10), (10, 11)}, 

which is an augmentation of d™ ap (consisting of the first six slides) and a 
dye-swapped design by itself. A complete enumeration shows that this design 
is, indeed, u>-optimal among all designs with A = 8 slides, for w = 1, 2, 3 and 
under the model (10). 

6. Concluding remarks. 

6.1. Robustness considerations. The results in this paper were obtained 
under the assumption that the log intensity ratios for a gene, arising from 
different slides, are homoscedastic and uncorrelated. A discussion on this as- 
sumption is warranted. In cDNA microarray experiments, the measurement 
error is typically swamped in biological variability. From a practical view- 
point, it is therefore appropriate to attribute the variance of an observed 
log intensity ratio arising from a slide (h . . . i n ,ji ■ ■ -jn) to components, say, 
7? in and 7? - n , representing the biological variability within the cell pop- 
ulations given by i\ . . .i n and j\ . . .j n , in addition to a component, say, 5 2 , 
due to the measurement error. Thus, this variance equals 7„ 2 , ,■ +7?, „■ + 5 2 . 
If the variance components 7? i are supposed to be equal for all cell pop- 
ulations [cf. Kerr (2003) and Altman and Hua (2006), among others] with 
common value say, 7 2 , then the log intensity ratios arising from different 
slides are homoscedastic with common variance a 2 = 2^ 2 + 5 2 . Furthermore, 
these ratios can be safely assumed to be uncorrelated if the replications for 
every treatment combination are biological (i.e., the same subject does not 
appear in more than one slide). Thus, in this situation all our results go 
through with a 2 = 2j 2 + 5 2 . 

If, however, the variance components 7? i associated with the cell popu- 
lations are not all equal, then the assumption of homoscedasticity no longer 
holds. In order to give a flavor of the robustness of our results to this pos- 
sibility, we revisit Sections 3 and 4. For the 2x2 factorial in Section 3, 
writing j 2 ii2 = J 2 li2 /S 2 , three patterns are considered for (7oo;7oi'7iO'7ii) : 
(i) (2,2.5,2.5,3), (ii) (2,3,4,6) and (iii) (6,4,3,2). Under (i)-(iii), one can 
employ the approximate design theory to find the w-optimal design mea- 
sures and, hence, obtain Table 3 showing the efficiencies of the design mea- 
sure ttq reported earlier in Result 1(a). It is satisfying to note that ttq, 
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Table 3 

Efficiencies of no under heteroscedasticity 



Situation 


w = 1 


w = 2 


w = 3 


(0 


99.90% 


99.89% 


99.86% 


(ii) 


99.25% 


99.03% 


98.91% 


(iii) 


99.43% 


99.15% 


99.00% 



which is ^-optimal under homoscedasticity, remains quite robust even to 
the appreciably heteroscedastic situations (ii) and (iii). The exact designs 
arising from ttq are also seen to remain highly efficient under (i)-(iii). The 
findings are almost equally encouraging for the nearly saturated optimal ex- 
act designs shown in Tables 1 and 2. Under (i)-(iii) and for w = 1,2,3, 
the designs in Table 1 often remain ui-optimal among all exact designs 
and, except in one case, always have efficiency over 97%. The exceptional 
case concerns the design for N = 6, which has efficiency 93.40% under (ii) 
when w = 3. For the 2x3 factorial, along the line of (i)-(iii), we consid- 
ered the patterns (2,2.5,2.5,2.5,3,3), (2,3,3,4,6,6) and (6,4,4,3,2,2) for 
(7ooj7oij7o2j7iO'7i1'7i2)- Under all these patterns and for w = 1,2,3, the 
designs in Table 2 often turn out to be lu-optimal and always have efficiency 
over 98%. 

The log intensity ratios from different slides, of course, get correlated when 
the same subject is allowed to appear in more than one slide. If we continue 
to assume the equality of the 7^ 1 ...j n and denote their common value by 7 2 , 
then the correlation terms depend on the ratio j 2 /6 2 , which is commonly 
unknown. As a result, the standard linear model based analysis and the asso- 
ciated optimal design theory will not work. On the other hand, if we pretend 
7 2 /5 2 to be known so as to allow the use of weighted least squares, empir- 
ical evidence suggests in favor of having only biological replications from 
the point of view of efficiency. To illustrate this point without making the 
presentation too involved, we consider the case of a 2 x 2 factorial design in 
N = 4 slides under the absence of dye color effects. We made an enumeration 
of all such designs that keep the main and interaction effects estimable and, 
for each design and every treatment combination, enumerated all possibili- 
ties for biological or technical replication (here technical replication means 
repeating the same subject on more than one slide). For instance, in the 
design d* = {(01, 00), (10, 00), (11, 01), (11, 10)}, the two replications for any 
treatment combination can be biological or technical, thus leading to 16 pos- 
sibilities arising from this design alone. The complete enumeration revealed 
that, if the ratio j 2 /5 2 is pretended to be known, then irrespective of the 
value of this ratio, the design d*, with all replicates biological, is zu-optimal 
whenever w > |. Earlier, in Table 1, the same design was reported to be 
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^-optimal for w = 1,2 and 3 in the homoscedastic and uncorrelated setup. 
Complete enumeration of this kind becomes unmanageable for more complex 
factorials, but partial enumeration in several other situations led to similar 
conclusions. This reinforces the findings in Kerr (2003) in a simpler setting 
and suggests that, in addition to making the log intensity ratios from differ- 
ent slides uncorrelated, use of only biological replicates can be advantageous 
from the perspective of design efficiency as well; see also Kendziorski et al. 
(2005) and the references therein for insightful practical results in a similar 
context. The point just noted makes sense if the cost of biological replication 
is negligible compared to the cost of the assay per slide, as has been tac- 
itly supposed in this paper. While Bueno Filho, Gilmour and Rosa (2006) 
mention that the number of slides is typically the most important limiting 
factor in microarray experiments, a more detailed discussion in this regard 
is available in Kerr (2003), who also dwelt on the situation where this is 
not the case. If the cost of biological replication is a real issue, then the 
design problem becomes much more complex. Instead of fixing the number 
of slides, as done here, one should then proceed in the spirit of Kerr (2003) 
to formulate the problem in terms of a cost function that incorporates the 
cost of the assays (slides), as well as the cost of biological replication. Given 
such a cost constraint, the possibility of technical replication and associated 
correlation will also have to be accounted for. Since commonly this correla- 
tion is unknown, the optimal design problem will then concern some kind of 
likelihood based rather than linear model based analysis. 

6.2. Further open issues. Even within the homoscedastic and uncorre- 
lated setting, there are several open issues that deserve attention. One of 
these concerns analytical derivation of optimal designs for N greater than 
v — 1 or 2(v — 1) in Sections 4 or 5 respectively. For instance, a proof of 
the Conjecture in Section 4.3 will be of interest. This can, however, be chal- 
lenging, and pending a complete solution, our heuristic approach holds the 
promise of yielding designs that are at least highly efficient. 

From a practical point of view, an important design problem is that of 
fractional replication. Compared to traditional factorials, a difficulty here is 
the lack of effect hierarchy [Wu and Hamada (2000), page 112]. Even in the 
two-factor case, the interaction can be of greater interest to biologists than 
the main effects. Hence, especially when the number of factors, n, is rela- 
tively small, it may be too drastic to ignore some interactions, as required 
in fractional replication. For large n, however, this can be a sensible option. 
The experience with factorial fractions under the orthogonal parametriza- 
tion [see, e.g., Dey and Mukerjee (1999)] suggests that then, under specifica- 
tion from biologists about the pattern of negligible interactions, the present 
techniques should be useful. 
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A problem, akin to that of fractional replication, concerns the study of 
optimal deigns when the impact of possible dye-color bias can be modeled 
via a reduced version of (10). Note that the model (10) allows a very general 
form for the effect of dye-coloring and, hence, is applicable to a broad spec- 
trum of situations. If in a specific application one has sufficient knowledge 
of the underlying process so as to entertain the risk of assuming that such 
effect is repeatable over slides, that is, additive to treatment effects, then in 
(10) one can replace A^...^ + A^...^ by a single parameter rj. In this case, it 
can be shown that at least v slides are required to keep all the 9s estimable. 
However, in contrast with Results 3 and 6, no single design with v slides is 
optimal simultaneously for all these parameters. For this reduced model, it 
is known that any even design (i.e., a design where every treatment combi- 
nation appears an even number of times) allows a dye-color assignment that 
ensures orthogonality to r\ [Kerr and Churchill (2001a)]. For even s\ and 
S2, the design in Conjecture (a) of Section 4.3 is even and, hence, with ap- 
propriate dye-color assignment, it is again conjectured to be optimal under 
this model. For odd s\ or S2 too, the initial findings are optimistic. Thus, 
for the 2x3 factorial, Conjecture (a) yields the nearly orthogonal design 
{(00, 01), (02, 00), (10, 00), (01, 11), (11, 10), (12, 02), (10, 12)}, and a complete 
enumeration shows that it is, indeed, ^-optimal among all designs with 7 
slides, for w = 1,2,3 and under the reduced model. 

In the present paper we studied optimal designs from the statistical con- 
sideration of efficiency. From this perspective, our designs often outperform 
more elementary ones that have gained popularity in applied work. For in- 
stance, in the setup of Sections 4 and 5, it is easy to check that the designs 
arising from Results 3 and 6 estimate the main effect parameters with the 
same variance and entail smaller variances for the interaction parameters, as 
compared to the commonly used reference design or the dye-swapped ver- 
sion thereof, respectively. These simpler designs may, however, have other 
practical benefits, including those dictated by manufacturer recommenda- 
tions. Nevertheless, as noted earlier, our results allow considerable flexibility 
under resource constraints and should be useful to applied researchers con- 
cerned with practical issues in addition to efficiency considerations. Even in 
extreme situations where such practicalities preclude direct implementation 
of the proposed designs, the latter would help in benchmarking the designs 
actually used from the point of view of efficiency. 

It is hoped that the present endeavor will generate further interest in the 
above directions. 
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SUPPLEMENTARY MATERIAL 

Optimal factorial designs for CDNA microarray experiments: Proofs (doi: 
10.1214/07- AOAS144SUPP; .pdf). Technical details, including proofs, ap- 
pear in a supplementary material file posted at the journal website. 
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